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HOMOLOGOUS RECOMBINATION IN MISMATCH REPAIR INACTIVATED EUKARYOTIC CELLS 



BACKGROUND OF THE INVENTION 
Introduction 

The introduction of specific modifications in the prokaryotic and eukaryotic genome is a 
powerful tool in studying gene function both at the level of individual cells and in the context 
of a complete organism. In addition, modification of specific genes may result in the 
generation of industrially and medically important organisms, whereas correction of defective 
alleles in eukaryotic cells may provide a substantial step forward in the development of 
somatic gene-therapy protocols. 

Tftp method of genetic modification relies on the ability of virtually every cell type to 
e&hange DNA sequences with a high degree of nucleotide' sequence similarity by a 
pjpcess which is called homologous recombination (1). Briefly, the method of genetic 
modification involves the generation of a so-called targeting construct, a DNA sequence that 
i|4argely identical to the specific chromosomal locus to be modified, but differing from this 
lotus by specific modifications. These modifications can be as small as the deletion, 
ifgertion or substitution of a single base-pair, or be as large as the deletion or insertion of 
tip's of kilobasepairs. On entry of the targeting construct into the cell, exchange of 
Sequences flanking the modification with their chromosomal counterparts, will result in the 
introduction of the- modification into the recipient chromosome. 

The efficiency of homologous recombination in both prokaryotes and eukaryotes strongly 
depends on complete sequence identity of exchanging DNA strands (2-5). Thus, sequence 
dissimilarities as small as 0.5% can already strongly impede homologous recombination. In 
several bacterial species, Escherichia coth Salmonella typhimurium and Streptococcus 
pneumoniae as well as in the yeast Saccharomyces cerevisiae it has been unequivocally 
shown that the DNA mismatch repair system is responsible for suppressing recombination 
berween homologous but nonidentical DNA sequences (6-8). 

We nave now demonstrated that in mammaiian cells deficient for the DNA mismatch repair 
gene Msh2 t homologous recombination has lost the requirement fcr complete sequence 
identity between exchanging DNA sequences. This finding provides a new method for 
modifying the eukaryotic genome using DNA targeting constructs which substantially differ 
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from the target locus in the region where recombination takes place by genetically and/or 
functionally inactivating the cell's mismatch correction system. 

The method obviates the. requirement for DNA targeting constructs that are largely identical 
to the target locus, thus allowing the efficient genetic modification of both somatic cells and 
the germ line of outbred organisms. This may provide a route to somatic gene therapy and 
the modification of eukaryotic species of which inbred strains are not easily available. Also, 
genomic sequences can be replaced by small oligonucleotides carrying one or more 
basepair alterations or by large chromosomal sequences derived from other species. The 
method can also be used to generate large deletions by intra- or extrachromosoma! 
homologous recombination between repeated but diverged sequences. 

Genetic modification of mice 

The introduction of specific genetic alterations into the germ line of mice was made possible 
by a combination of new techniques in molecular biology and embryology that have 
become available during the last ten years (9-11). The method entails the introduction of a 
planned genetic modification in embryonic stem (ES) cells by homologous recombination. 
ES cells are originally derived from the inner cell mass of 3.5 day old pre-implantation 
embryos and can be maintained by in vitro culture as immortalized cell lines, retaining the 
undifferentiated state under appropriate culture conditions. At present, a number of ES cell 
lines are available. ES cells injected into the btastocoel of 3.5 day blastocyst-stage 
embryos can efficiently compete with the inner-cell-mass ceils of the recipient blastocyst in 
embryonic development thus generating a chimeric mouse consisting of cells derived from 
the recipient blastocyst and the injected ES cells. Should the in vitro modified ES cells 
contribute to the germ line of chimeric animals, the ES ceil genome can be transmitted to the 
next generation giving animals carrying the introduced modification in all their cells. 
Intercrossing of such animals reveals the phenotypic consequences of homozygosity of the 
modified gene. 

The most commonly introduced genetic modification in ES cells is gene disruption leading to 
gene inactivation. This allows to study gene function by analyzing the consequences of the 
absence of a particular gene in the context of a complete organism. Also, many hereditary 
human diseases result from hemizygosity of specific genes [e.g. cancer predisposition 

syndromes like hereditary nonpoiyposis colorectal cancer (12), Li Fraumeni syndrome (13), 
retinoblastoma (14)]. The generation of mouse models for such diseases by disruption of 
the mouse homologs of the genes involved provides an invaluable tool for studying 
hereditary diseases in an experimental setting. 
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Gene disruption in ES cells 

Thelnactivation of a specific gene in ES cells via homologous recombination starts with the 
preparation of a DNA targeting construct (15). Two types of targeting constructs can be 
used. In a replacement-type vector, a drug-resistance marker gene (conferring to the ceil 
resistance to drugs like G418, Hygromycin B, Puromycin, Histidinol) disrupts a sequence 
which is homologous to a sequence in or around the target gene in the recipient genome; in 
an insertion-type vector the marker gene flanks the homologous sequence. The targeting 
construct is introduced into ES cells by electroporation (or alternatively by Ca-Phosphate 
precipitation, lyposome-mediated DNA transfer or micro-injection) and cells are selected for 
stable integration of the targeting construct into the recipient genome by growth in medium 
containing the appropiate drug. Drug-resistant colonies are the result of either one of two 
events: integration of the targeting construct at a random site in the genome or integration of 
the marker gene in the target locus via homologous recombination between the flanking 
sequences and their chromosomal counterparts. The repiacement-type vector integrates by 
homologous recombination on both sides of the marker gene; the insertion-type vector 
integrates by a single homologous recombination event leading to duplication of the region 
of homology. Thus, the marker gene serves two purposes: ft allows selection of cells that 
have taken up the targeting construct and, on integration via homologous recombination, it 
will disrupt the target gene thereby modifying its function. 

Distinguishing ES cell clones resulting from homologous recombination from those resulting 
from random integration, requires the DNA of individual clones to be analysed by Southern 
hybridisation or the polymerase chain reaction. 

Unfortunately, in many targeting experiments, random integration was often found to be far 
more efficient than homologous recombination and also large variations in targeting efficiency 
were observed for different genes (16). In this respect, mammalian cells differ from bacteria 
and lower eukaryotes Irke yeast (17), Leishmania major (18) or Trypanosoma brucei (19), 
where integration of exogenous DNA into the recipient genome exclusively or predominantly 
occurs via homologous recombination. To date, three factors clearly affecting the recovery of 
homologous recombinants in mammalian cells have been identified. First, the frequency of 
homologous recombination increases substantially with the total length of the homologous 
sequences up to 14 kilobase-pairs (20). Second, the expression level of the marker gene at 
the target locus affects the frequency of recovery of homologous recombinants: low 
expression may lead to loss of homologous recombinants, whereas high expression may 
allow selection of homologous recombinants at an elevated drug concentration (21). Third, 
sequence dissimilarities between the targeting construct and the chromosomal target locus 
strongly suppress the efficiency of homologous recombination (5). 
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High efficiency targeting with isogenic DNA constructs 

The suppression of homologous recombination in ES cells by small sequence dissimilarities 
became clear by a gene targeting experiment aimed at disrupting the Retinoblastoma gene 
with a neomycin resistance marker gene. Two targeting constructs were prepared carrying 
the neomycin resistance marker embedded in 10.5 kb of Rb sequence. In one construct the 
Rb sequence was derived from mouse strain 129, and was therefore identical to the 
corresponding chromosomal locus in the ES cells, which were also derived from mouse 
strain 129. In the other construct, the Rb sequence was derived from mouse strain BALB/c. 
The two constructs, designated 129Rb-neo and B/cRb-neo are described in the 
accompanying publication Proc. NaiL Acad. ScL USA, VoL 89, pp. 5128-5132 on pages 
5128 and 5129, Fig. 1,The two constructs contained corresponding Rb sequences and 
were therefore largely similar. However, they differed approximately 0.6% at the nucleotide 
level (which corresponds to the level of sequence polymorphism found in the human 
population), Thus, in a stretch of 1687 basepairs that was sequenced, the BALB/c 
sfjuence differed from the 129 sequence by 9 basepair substitutions, three small deletions 
(#jl,4 and 6 nucleotides) and two polymorphic CA-repeats (see Proc. Natl. Acad. ScL 
LfM, Vol. 89, pp. 5128-5132, page 5130, Fig. 4). On introduction of these constructs in 
l2jp-derived ES ceils, homologous recombination at Rb with the 129-derived construct was 
50J-fold more efficient than with the nonisogenic BALB/c-de rived construct. To provide 
^Uditional evidence that the suppression of recombination was solely dependent on the 
polymorphisms between the endogenous locus and the targeting DNA, the inverse 
experiment was performed, i.e. targeting of a BALB/c-derived ES cell line with the 129- and 
pALB/c-derived constructs. This experiment yielded the inverse result, i.e. a higher 
isrgeting efficiency with the BALB/c-derived construct than with the nonisogenic 129- 
derived construct; 

With a somewhat different targeting construct, consisting of a hygromycin resistance gene 
embedded in 17 kb of isogenic Rb DNA, we observed that 80% of all Hygromycin B- 
resistant colonies resulted from homologous recombination (5). This demonstrates that, in 
the presence of perfect homology, also in mammalian cells homologous recombination rather 
than random integration can be the predominant event. 

Although clearly not all problems of gene targeting have been solved, many genes have 
now been successfully targeted by the use of isogenic DNA targeting constructs. However, 
genetic modification of cells derved from an outbred organism can become a difficult 
endeavour as isogenic targeting constructs are not easily available. In this case, efficient 
gene targeting would require the targeting construct to be prepared from DNA derived from 
the target cell. Especially in the context of gene therapy, this v/ould raise a tremendous 
practical obstacle to correction of a defective gene. Also, base sequence divergence 
■mooses a major barrier to exchanging a large chromosomal region of one species by the 
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these problems. 



The introduction of subtle mutations 

Although protocols for disruption of genes in inbred ES cell lines (and in somatic cell lines of which 
isogenic DNA targeting constructs can be prepared) are rather well developed, the introduction of 
more subtle mutations is not straightforward. Current protocols are variations on a two-step 
procedure in which first a marker gene is introduced into the target gene followed by replacement of 
the marker gene by the desired subtle mutation (5). This procedure requires the marker gene to be 
selectable both for its presence (first step) and its absence (replacement by the subtle mutation). 

Useful marker genes are the Hprf minigene to be used in Hp/t-deficient ES cells (positive selection 
in HAT medium; negative selection by 6-thioguanine) and a combination of the neomycin resistance 
gene (positive selection by G418) and the Herpes Simplex Virus thymidine kinase gene (negative 
selection by Gancyclovir). In an alternative procedure, the subtle mutation and the marker gene are 
present on the same targeting construct and concomittantiy introduced into the genome by 
homologous recombination, if an insertion-type vector wasused, the marker gene can be removed 
by intrachromosomal recombination between the duplicated sequences that were generated during 
the first integration event (23). In case of a replacement-type vector, the marker gene can be 
removed if it was flanked by two site-specific-recombination sites (e.g. loxP sites). Recombination 
between these sites on introduction into the ceil of the /oxP-specific recombinase Cre will remove 
the marker gene from the genome (24). 

Although either of the above mentioned procedures has allowed the subtle modification of a number 
of genes in ES cells, they are highly demanding as to the generation of appropiate DNA targeting 
constructs and the cuituring of ES cells under various selective conditions. 

Therefore, an attractive alternative to these procedures might be the use of small single- or double- 
stranded oligonucleotides (up to 100 bases or basepairs), which are identical to the target locus 
except for one or several basepair alterations. However, our finding that base sequence 
dissimilarities as small as 0.6% strongly suppress homologous recombination, may impose a major 
impediment to using such oligonucleotides for the introduction of subtle genetic modifications. The 
present invention provides a way to overcome this problem and may allow the subtle modification 
of cell lines and cells derived from living organisms and temporarily cultured in vitro. 
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GENETIC MODIFICATION OF EUKARYOT1C CELLS USING DNA CONSTRUCTS 
WITH SUBSTANTIAL BASE SEQUENCE DIFFERENCES WITH RESPECT TO THE 
TARGET LOCUS. 

DESCRIPTION OF THE INVENTION 

The manuscript included as annex in this application provides a detailed description of 
the methodology used in this application and will be published in the journal Cell on July 
28, 1995. 

As described above, base sequence dissimilarities as modest as 0.6% impose a strong barrier to 
efficient homologous recombination in mouse embryonic stem cells (5). Suppression of homologous 
recombination by small base sequence divergencies was earlier observed in bacteria (£ coli. S. 
typhimurium and S. pneumoniae), yeast and mouse fibroblasts (2-4). The role of the DNA mismatch 
correction system in suppressing recombination between homologous but diverged sequences, 
was most dramatically demonstrated by Radman and coworkers in studying bacterial conjugation 
between the related but diverged species Escherichia coli and Salmonella typhimurium (6). This 
process relies on entry of chromosomal fragments of one species into the other and recombination of 
these fragments with the chomosome of the recipient bacterium. The sequence divergence between 
the two species is estimated to be 20-30% and therefore the recovery of exconjugants from an 
interspecies cross is about 2x1 0 5 fold lower than from an intraspecies cross. However, the 
recovery of exconjugants from the interspecies cross increased 3x1 0 3 fold if the recipient bacteria 
carried an inactivating mutation in either the mutSor mutL gene, both of which being essential for 
DNA mismatch correction. 

The central protein in DNA mismatch correction in E colils encoded by the mutS gene (25). It 
recognizes and binds to base mispairs and small loops of up to four unpaired nucleotides. After 
binding to heterodupiex DNA, the MutS-DNA complex is bound by the mutL gene product which 
leads to excision of a tract of single-stranded DNA of up to several kilobases that contains the 
mispaired nucleotide(s). In this process also the mutUgene product plays a role, whereas the mutH 
gene product ensures removal of newly synthesized strands rather than parental strands. The 
repair process is completed by resynthesis of the excised strand and ligation of the remaining nick. 
Mismatch repair in £ coli is responsible for maintaining genome stability in at least two ways: i) by 
recognizing and repairing mis- and unpaired nucleotides that occur, respectively, by 
misincorporation and slippage during DNA replication; ii) by recognizing mismatches occurring in 
heterodupiexes formed at initial stages of recombination between homologous but not-identicai 
sequences. This may either lead to blocking elongation of heterodupiex formation or dissociation cf 
the heterodupiex thus aborting the recombination reaction. Consequently, £ coli strains defective 
br either mutS or mutL, have a pleiotropic phenotype; an increased mutation rate, including 
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destafailization of simple-sequence repeats and an increased rate of recombination between 
homologous but diverged DNA sequences. The latter phenotype is clearly manifested by the 
efficient recovery of recombinant bacteria resulting from conjugationa! crosses between the related 
but diverged species Escherichia coliand Salmonella typhimurium wherein the recipient bacterium 
was deficient for mutS or mutL (6), Also, the frequency of chromosomal rearrangements by ectopic 
recombination between diverged sequences is substantially elevated in mismatch repair deficient 
bacteria (26). 

In many respects, the biochemistry of mismatch repair systems in eukaryotes resembles 
that of the E. coli mutS,L system. Homologs of both genes have been identified in yeast 
and mammalian cells. Based on mismatch binding in vitro, and on the mutator and 
recombinator phenotypes of Saccharomyces cerevisiae mutants, the protein encoded by 
the yeast MSH2 gene seems to be the functional homolog of MutS (27-29). A homolog of 
the yeast MSH2 gene was identified in mammalian ceils by analysis of a GT-mismatch- 
binding activity, positional cloning and PGR amplification of mouse DNA using degenerate 
primers (30). Similarly, homologs of the E. coli mutL gene were identified in yeast and 
mammalian cells. 

Interestingly, inherited mutations in human mutS and mutL homologs were recently found to 
be related to the cancer predisposition syndrome HNPCC (hereditary nonpolyposis 
colorectal cancer), which is characterized by development of tumors of the proximal colon at 
early age. In these tumors, mismatch repair is lost, as manifested by destabilization of 
simple sequence repeats, the replication error-positive (RER+) phenotype (12). 

In order to investigate the role of the mammalian Msh2ger\e in DNA mismatch repair and to 
address the role of mismatch repair in maintaining genome stability, we generated an ES cell 
line carrying a disruption in both copies of the mouse MSH2 gene. This line is disignated 
dMsh2-9. Its construction is described in the accompanying manuscript: 
"Inactivatlbn of the mouse Msh2 gene results in mismatch repair deficiency, 
methylation tolerance, hyperrecombination, and predisposition to cancer", by 
Niels de Wind, Marleen Dekker, Anton Berns, Mirosiav Radman, and Hein te Riele, 
to be published in Cell, the 28th July, 1995 on pages 6, 14, 27 (legend to Fig.1) and 30 
(Fig. 1)- 

As demonstrated in the manuscript, the phenotypic consequences of Msh2 deficiency in 
mouse embryonic stem ceils provide clear evidence for an essential role of MSH2 in 
mammalian DNA mismatch repair, First, Msh2-defictent ES cells lack binding activity to a 
double stranded 38-mer oligonucleotide carrying a G*T mismatch or an unpaired TG 
dinucleotide [pages 6, 15, 27 (legend to Fig.2) and 30 (Fig.2)]. Second, Msn2-dencient ES 
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ceHs have a mutator phenotype as evidenced by an at least 150-fold increase in the 
number of cells resistant to 6-thioguanine, indicating mutational inactivation of the X-Iinked 
Hprt gene. Moreover, microsatellite length instability was observed in subclones derived 
from the Ms/?2-deficient ES cell line, but not in subclones derived from wild-type ES cells 
[pages 7, 15, 27 (legend to Fig.3) and 30 (Fig.3)]. Third, Ms/i2-deficient ES cells resisted a 
20-fold higher concentration of the methylating agent N-Methyl-N-Nitro-N-Nitrosoguanidine 
than wild-type ES ceils [pages 7/8, 16, 27 (legend to Fig.4) and 31 (Fig. 4)]. Fourth, mice 
bred to homozygosity for the disrupted Msh2 allele, originally generated in ES cells, were 
highly predisposed to tumorigenesis [pages 9, 14/15, and 29 (Table 2)]. 
Thus, disruption of Msh2 leads to inactivation of mammalian DNA mismatch repair. 

By using a gene targeting assay, we demonstrated that it is the mammalian DNA mismatch 
repair system which is responsible for suppressing homologous recombination between 
sequences differing as little as 0.6% at the nucleotide level. As described above, 
homologous recombination at the Rb Iccus with an isogenic DNA targeting construct was 
50-fold more efficient than with a similar, but nonisogenic construct containing on the 
average 0.6% sequence divergence with respect to the target locus (5). However, in 
jWs/^-deficient ES cells, homologous recombination at Rb with the nonisogenic targeting 
construct was as efficient as with the isogenic construct [see the accompanying manuscript: 
"Inactivation of the mouse Msh2 gene results in mismatch repair deficiency, 
methylation tolerance, hyperrecombination, and predisposition to cancer", by 
Niels.de Wind, Marleen Dekker, Anton Berns, Mirosiav Radman, and Hein te Riele (pages 
8/9,16, 28 (Table 1)]. 

This experiment demonstrates that it is the mammalian DNA mismatch repair system which 
is involved in preventing recombination between homologous but diverged DNA 
sequences. 

This finding provides a method for modifying the mammalian genome but also the genome of 
any other eukaryotic organism via homologous recombination using DNA targeting 
sequences that substantially differ from the target locus with respect to the nucleotide 
sequence, by genetic or functional inactivation of the cell's mismatch repair system. 
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This manuscript is pan of the application and contains a detailed description 
of the methodology as described in the application. This manuscript will be 
published in the journal Cell on July 28, 1995, 
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•Summary 



To investigate the role of the presumed DNA mismatch repair (MMR) gene 
Msh2 in genome stability and tumorigenesis, we have generated cells and 
mice that are deficient for the gene. A/s/i2-deficient cells have lost mismatch 
binding, and have acquired microsatellite instability, a mutator phenotype 
and tolerance to methylating agents. Moreover, in these cells, homologous 
recombination has lost dependence on complete identity between interacting 
DNA sequences, suggesting that Msh2 is involved in safeguarding the 
genome from promiscuous recombination. iV/s/i2-deficient mice display no 
major abnormalities, but a significant fraction develops lymphomas at early 
age. Thus, Msh2 is involved in MMR, controlling several aspects of genome 
stability; loss of MMR-controlIed genome stability predisposes to cancer. 
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Introduction 

Cancer is the endpoint of an evolutionary process in which normal cells acquire full 
malignancy by cumulative genetic or epi-genetic alterations, each conferring proliferative, 
invasive or metastatic potential to the cell. In colorectal cancer, at least six independent 
transforming mutations in oncogenes or tumor suppressor genes are required for a cell to 
develop into a fully malignant tumor (reviewed by Vogelstein and Kinzler, 1993). This 
evolutionary process might be dramatically accelerated by the early loss of systems that 
safeguard genome stability, resulting in an increased rate of mutagenesis and chromosomal 
rearrangements (Hartwell, 1992 and Loeb, 1994). In support of this hypothesis, a number of 
hereditary cancer predisposition syndromes are attributed to defects in DNA repair. Two of 
these are Hereditary Nonpolyposis Colorectal Cancer (HNPCC) and Muir-Torre syndrome, 
which are characterized by the development of tumors of the proximal colon at early age. 
Also cancers of other parts of the gastrointestinal tract and of the genitourinary tract are often 
found (Lynch et al., 1993). Muir-Toire patients are, in addition, prone to cancers of the 
sebaceous glands and the skin (Cohen et al., 1991). Individuals affected by one of these 
syndromes have most likely inherited a mutant allele of either one of four mismatch-repair 
(MMR)-related genes called MSH2 , MLH1. PMS1 or PMS2 (Leach et al., 1993; Bronner 
et al., 1994; Kolodner et al., 1994; Liu et al., 1994, Nicolaides et al, 1994; Papadopoulos et- 
al., 1994; Kolodner et ah, 1995; reviewed by Jiricny, 1994 and by Modrich, 1994). In 
tumors of these patients, MMR is lost , presumably through inactivation or loss of the wild- 
type allele (Leach et al., 1993; Nicolaides et al., 1994; Papadopoulos et aL, 1994; Hemminki 
et al. 1994). The mechanisms of loss of MMR, the cause of the apparent tissue specificity of 
tumorigenesis and the role of loss of MMR in the evolution of a normal cell to fully malignant 
tumors are still obscure. Moreover, recent data suggest that MMR is frequently lost in many 
types of sporadic tumors (Han et al., 1993; Parsons et aL, 1993; Risinger et al., 1994; 
Chong et al., 1994; Mironov et al., 1994; Orth et aL, 1994; Shibata et al., 1994; Suzuki et 
aL, 1994; Umar et aL, 1994b; Wada et aL, 1994; Wooster et aL, 1994; Liu et aL, 1995). 
This supports the notion that loss of MMR may play a general role in tumor evolution. 
The best studied MMR system with respect to both genetics and biochemistry is the mutSL 
pathway of Escherichia coli (reviewed by Modrich, 1991). Central in this pathway is the 
protein encoded by the mutS gene, which binds to base mispairs and to loops of up to four 
unpaired nucleotides. After binding to the heteroduplex, the MutS-DNA complex is bound by 
the mutL gene product, triggering excision of a tract of single-stranded DNA of up to several 
kilobases that contains the mismatched nucleotide(s). The repair process is completed by 
resynthesis of the excised DNA strand and ligation of the remaining nick, MMR in £. coli is 
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responsible for maintaining genome integrity in at least two ways: (i) by recognizing and re- 
pairing mispaired and extrahelical nucleotides that occur by, respectively, misincorporation 
and slippage during replication and (ii) by recognizing mismatches occurring in 
recombination intermediates between homologous but diverged sequences, leading to 
abortion of recombination. Consequently, £. coli strains deficient for either mutS or mutL 
have a pleiotropic phenotype: an increased mutation rate, including destabilization of simple- 
sequence repeats (also called microsatellites; the 'Replication Error + \ or RER+ phenotype) 
and promiscuity in inter- and intragenomic recombination (the recombinator phenotype, 
Rayssiguier et ah, 1989; Petit et ah, 1991). 

In many respects the biochemistry of MMR systems in eukaryotes resembles that of the 
mutSL system (Holmes et ah, 1990; Variet et ah, 1990; Thomas et al., 1991; Fang and 
Modrich, 1993). Homologs of both mutS and mutL were found in eukaryotes. Based on 
mismatch binding in vitro and on the mutator and recombinator phenotypes of 
Saccharomyces cerevisiae mutants, the protein encoded by the yeast MSH2 gene (Reenan 
and Kolodner, 1992a) seems to be the functional homolog of MutS (Reenan and Kolodner, 
1992b; Miret et al, 1993; Alani et al, 1995). In higher eukaryotes (Xenopus laevis, mouse 
and human) homologs of the yeast MSH2 gene have recently been identified, based on either 
sequence homology, direct sequencing of the major mismatch-binding protein or by 
positional cloning (Fishei et ah, 1993; Leach et al., 1993; Palombo et al., 1994; Variet et al./ 
1994). An apparent difference between MutS and MSH2, is the binding of the eukaryotic 
protein, when overproduced and purified from E. coli or yeast, to stretches of up to 14 
extrahelical nucleotides (so called insertion-deletion-type loops, or IDLs) in vitro (Fishei et 
al., 1994b; Alani et al M 1995), suggestive of a role for Msh2 in the repair of these loops 
(Umax et al, 1994). Eukaryotic homologs of the mutL gene, called MLHL PMS1 and PMS2 
have been analyzed in yeast (Kramer et al M 1989a;. Kramer et al., 1989b; Strand et ah, 1993; 
Prolla et al., 1994a; Prolla et al., 1994b) and human cells (Li and Modrich. 1995). 
The precise role of these mutS and mutL homologs in MMR and the role of MMR in the 
maintenance of genome stability in mammalian cells remain to be established. In addition, the 
causative role of MMR in cellular sensitivity to simple methylaung agents, as was suggested 
by the loss of MMR after selection of cells for tolerance to these agents (Branch et al., 1993; 
Kat et al., 1993) and by the methylation tolerance of a MMR-denciem tumor cell line (Koi et 
al M 1994) remains to be explored. 

To investigate the involvement of Msh2 in mammalian MMR and the roles of MMR in 
maintaining genome stability we have generated mouse Embryonic Stem (ES) cell iines 
which lack both copies of the Msh2 gene. We show that Ms7:2-derlcient cells lack binding to 
some specific mismatches, have acquired a mutator phenotype and are tolerant to simple 
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methylating agents. Moreover, these ceils have lost the barrier to recombination between 
homologous but diverged sequences. Taken together, these results suggest that MMR is 
essenual for the preservadon of multiple aspects of genome stability in mammalian cells. 
To study the effect of MMR deficiency on the edology and evolution of cancer we generated 
mice lacking one or both copies of the Msh2 gene in part or all of their cells. Although 
heterozygous mice, a model for HNPCC, remain healthy to at least 8 months of age, a 
significant fraction of both chimeric and fully homozygous Mshl knock-out mice succumbs 
to lymphomas. These results provide evidence that loss of MMR-controlled genome stability 
can accelerate the development of cancer. 
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-Results 

Targeted disruption of the Msh2 gene in mouse ES cells 

Heterozygous ES cell lines sMsh2-42 and $Msh2-55 7 carrying a hygromycin-resistance gene 
inserted between codons 588 and 589 of one allele of the Msh2 gene, were generated by gene 
targeting using the construct shown in Fig. 1 A. Cell line dMsh2-9> carrying the hyg marker 
in both alleles of the Msh2 gene, was obtained by selecting cell line $Msh2-55 for duplication 
of the chromosome carrying the targeted Msh2 allele at an elevated hygromycin B 
concentration, a method that was previously shown to be effective using the neomycin- 
resistance marker (Mortensen et al, 1992). Targeting events were detected by Southern blot 
analysis, using probes flanking both sides of the targeting construct (Fig, 1A). An example 
of such a Southern blot, demonstrating the status of the Msh2 alleles in cell lines wt-2 (a 
control cell line with the targeting construct integrated by non-homologous recombination), 
sMsh2-55 and dMsh2-9 is shown in Fig. IB. Diploidy of cell lines sMsh2-55 and dMsh2-9 
was verified by karyotyping (not shown). 

The hyg marker was inserted, in the antisense orientation, N-terminally of the putative ATP 
binding site (Fig. 1 A) in a region where (in the human gene) truncations were found in many 
HNPCC tumors (Liu et ah, 1994). Moreover, no Msh2 transcript could be detected in ES 
cell line dMsh2-9 by Northern blotting (not shown), indicating that the hyg marker strongly- 
suppressed Msh2 expression. The genotype of ES cell line &Nfsh2-9 will therefore be 
indicated by Msh2-I-. 

Mismatch binding in Msh2-f- cells 

Two different mismatch-binding activities have been described in mammalian cell extracts 
using gel-retardation assays: (i) An activity that recognises GT mismatches (Jiricny et al., 
1988; Stephenson and Karran, 1989; Griffin and Karran, 1993), an extrahelical TG 
dinucleotide (Aquilina et al; 1994) representing an intermediate in replicational slippage of a 
microsatellite and, weakly, G.A, G.G, A.C and G.U mismatches (Stephenson and Karran, 
1989; Hughes and Jiricny, 1992). (ii) In one cell line, an independent activity was detected 
that recognizes A.C mismatches and also pyrimidine-pyrimidine mismatches (Stephenson 
and Karran, 1989). Based on protein sequencing, the purified GT-binding activity was 
shown to contain the MSH2 protein (Palombo et al„ 1994). To characterize the mismatch- 
binding pronerties of the Msh2-f- ES cell line, we have performed gel-retardation assays 
using cell extracts of wild-type ES cell line wt-2 and A/j/z2-deficient line cLV/s/22-9. Extracts 
were incubated with radiolabeled 38-mer double-stranded oligonucleotides containing either 
a mismatch, an extrahelical dinucleotide, or a 14-nucleotide insertion-deietion-type loop 
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.(IDL), followed by electrophoresis on a nondenaturing polyacrylamide gel. Results of these 
experiments as shown in Fig. 2 clearly demonstrate binding activity in wild-type cell extracts 
to a G.T mismatch and to an extrahelical TG dinucleotide. Binding to both oligonucleotides 
was entirely absent in the extract of the Msh2-I- ES cell line, demonstrating the involvement 
of the Msh2 protein in this binding activity. As shown before (Hughes and Jiricny, 1992), 
binding was abolished by the inclusion of ATP in the binding reaction (not shown). We were 
unable to detect specific binding to G.A t G.G, or A.C mismatches under the conditions used 
in wild-type or mutant cell extracts. In addition, no Msh2-dependent binding to a 14- 
nucleotide IDL was observed (Fig. 2). 

Msh2-f~ cells have a mutator phenotype 

The Msh2-/- cell line was indistinguishable from the wild-type cell line wt-2 and the parental 
heterozygous cell line sMsh2-55 with respect to growth and plating efficiency (not shown). 
To address the role of Msh2 in MMR t we determined the stability of two microsatellite 
markers in Msh2-f- cell line dAlsh2-9 and Msh2+f+ line wt-2. For this purpose, 24 
subclones were derived of each cell line, which had undergone approximately 20 cell 
divisions since its generation. Chromosomal DNA, isolated from each expanded subclone 
was subjected to the polymerase chain reaction (PCR) using two end-labelled primer pairs 
\D7Mitl7 and D14Mitl5\ Dietrich et al., 1994). Whereas none of the wt-2 sublines showed* 
any alteration in microsatellite length, clear length alteradons in many dMy/z2-9 sublines were 
seen for both microsatellites (8 out of 24 for marker D7MU17 and 6 out of 24 for marker 
D14Mitl5, Fig.3). This observation strongly suggests that the Msh2 protein is involved in 
the repair of slipped replication intermediates. The rnicrosatellite-slippage rate was estimated 
to be 10" 2 to lO 3 per generation. 

We subsequently investigated the effect of loss of the Msh2 gene on the mutation frequency 
of a functional gene. To this purpose, 6xl0 6 cells of cell lines wt-2 and dMs/z2-9 were plated 
in the presence of 6-thioguanine (6-TG) to select for cells that have lost activity of the X- 
linked Hprt gene by mutation. Whereas no 6-TG-resistant colonies were seen in the wt-2 
culture; 188 resistant colonies were present in the &Msh2-9 culture. This result extends the 
mutator phenotype of Msh2-/- cells to functional genes. 

Msh2-I- cells are tolerant to N-methyl-N f -nitro-N-nitrosoguanidine (MNNG) 

MMR was suggested to mediate hypersensitivity to agents that methyiate G residues at the O 5 
Dosicion, It is believed that MMR recognises the O^-meG.T mispair that occurs after 
erroneous incorporation of a thymidine nucleotide opposite to 0 6 -meG during replication 
(Griffm et aL, 1994). This is supposedly followed by excision and repair synthesis, again 
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-incorporating thymidine opposite to 0 6 -meG, triggering a new round of MMR. The net result 
of this cycling between excision and resynthesis will be the presence of single-stranded 
regions at the site of 0 6 -meG residues which will lead to double-stranded gaps when 
replicated during S phase, resulting in cell death (for reviews, see Karran and Bignami, 
1992; Karran and Bignami, 1994). It is predicted that loss of MMR will prevent this process, 
thus conferring tolerance to simple methylating agents. To test this hypothesis directly, 
survival of Msh2-I- cell line dM.yA2-9, Msh+I- cell line sMsh2-2\ and wild-type cell line wt-2 
was determined after exposure to a range of concentrations of N-methyl-N'-nitro-N- 
nitrosoguanidine (MNNG). In the growth medium, 0 6 -benzylguanine was included to 
competitively inhibit endogenous methyltransferase activity that might otherwise remove the 
methyl groups added by MNNG (Dolan et al., 1990). Fig. 4 shows that the LD50 for 
MNNG was increased 20-fold in the Msh2-I- cell line compared to the heterozygous and 
wild-type cell lines, directly proving the involvement of Msh2 in determining cellular 
sensitivity to MNNG. Heterozygous cell line sMsh2-2l displayed no increased tolerance to 
MNNG. 

Loss of suppression of recombination between diverged sequences in 
Msh2-/- cells 

In E. coli (Shen and Huang, 1986), Drosophila (Nassif and Engels, 1993) and mammalian 
cells (Waldman and Liskay, 1988, Te Riele et al, 1992), the efficiency of recombination 
between homologous DNA stretches is highly dependent on their sequence identity. We have 
previously demonstrated that in a gene targeting assay, homologous recombination at the 
Retinoblastoma (Rb) locus in ES cells, derived from mouse strain 129, was 50-fold more 
efficient with a 129-derived targeting construct than with a construct derived from a non- 
isogenic (BALB/c) mouse strain (Te Riele et al, 1992; Table 1). This construct contained 
0.6% base sequence divergence with respect to the isogenic 129 construct. To provide 
additional evidence that suppression of recombination was solely dependent on the 
polymorphisms between the endogenous locus and the targeting DNA, we performed the 
inverse experiment, i.e. targeting of a BALB/c-derived ES cell line with the 129- and 
B ALB/c-derived constructs. This experiment yielded the inverse result, i.e. a higher targeting 
efficiency with the BALB/c targeting construct than with the non-isogenic 129 targeting 
construct (Table 1). To investigate whether MMR is responsible for this antirecombination 
effect, we have repeated the Rb targeting experiment in the (129-derived) ES cell line ±Msh2~ 
9. We found that in the Msh2-/~ cell line homologous recombination at the Rb locus with the 
non-isogenic construct was as efficient as with the isogenic construct (Table 1). This 



18 

SUBSTITUTE SHEET (RULE 25) 



WO 91/05168 



PCT/EP95/02980 



experiment demonstrates that Msh2 is involved in preventing homologous recombination 
between diverged DNA sequences, 

Msh2-deficient mice are viable but predisposed to tumorigenesis 
To study the effect of MMR deficiency on the development of cancer, Msh2+/- ES cell lines 
sMsh2-55 and sMsh2 -42 and Msh2-/- ES cell line duVlsh2-9 were used to generate chimeric 
mice. Both the Msh2 single and double knock-out ES cells gave rise to healthy animals with 
coat-color chimerism of 20 to 70 %. This result suggests that MMR deficiency did not alter 
the capacity of the ES cells to compete with wild-type cells in chimeric development. 
Msh2+/~ chimeras were found to transmit the mutant allele through the germ line resulting in 
Msh2 heterozygous Fl mice. These mice, which may serve as a model for HNPCC, were 
healthy up to at least 8 months of age (Table 2). 

Intercrossing of Msh2+f- mice resulted in Msh2-/~ mice according to a normal Mendelian 
distribution; they were healthy at birth and fertile. However, 30% of the Msh2-f- mice 
developed metastasizing lymphomas of T-celi origin with a peak at 2 months of age (Table 
2). In addition, one mouse suffered from generalized histiocytic sarcoma at 3.5 months of 
age. Extensive histological analyses of these mice did not reveal any other abnormality. Also 
healthy Msh2-I- mice sacrificed at 2 months of age were histologically normal and had 
normal B- and T-cell populations as evidenced by FACS analysis. 
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Discussion 

Lack of mismatch binding in Msh2-I- cell extracts 

Whereas wild-type cell extracts display efficient binding to both a G.T mismatch as well as to 
an extrahelical TG dinucleotide. extracts of the ES cell line 6N!sh2-9 lack this binding (Fi<*. 
2), suggesting that the Msh2 protein is essential for binding to these heteroduplexes. We 
were unable to detect specific binding activity to other mismatches under the conditions used. 
This result is compatible with binding studies by others which demonstrate that binding to a 
G-T mismatch is stronger than to other mismatches (Stephenson and Karran, 1989; Hughes 
and Jiricny. 1992) whereas the independent A.C binding activity is apparently cell-type 
specific (Jiricny et aL, 1988; Stephenson and Karran, 1989). The methylation-tolerant cell 
lines 'clone B* and RajiF12 were also found to lack bindingjo both a G.T mismatch (Branch 
et ah, 1993; Griffin ec aL, 1994) and to an extrahelical TG dinucleotide (Aquiiina et aL, 
1994). Binding to an A.C mismatch was unaffected (Branch et aL, 1993). Therefore, these, 
cell lines may have lost the Msh2 gene. Purified human MSH2, overproduced in £. coli % and 
purified yeast MSH2 were shown to bind to G.T mismatches (Fishel et al. f 1994a; Alani et 
aL, 1995), but also strongly to IDLs of up to 14 nucleotides (Fishel et ai., 1994b; Alani et 
aL, 1995). Surprizingly, we could not detect Msh2-dependent binding to a 14 nucleotide IDL 
in ES cell extracts. We are unable to explain this apparent discrepancy. In contrast to the - 
binding that we observed in ES cell extracts and that of the GTBP protein, the MSH2- 
containing G.T mismatch-binding protein fraction purified from HeLa cells (Hughes and 
Jiricny, 1992; Palombo etal., 1994), the binding capacity of the purified MSH2 proteins was 
not abolished by ATP (Fishel et aL, 1994b; Alani et aL, 1995). This suggests that the 
purified MSH2 proteins behave differently from the naturally produced protein. The finding 
that in Msh2~l- cells length shifts in microsatellites were never larger than 4 nucleotides 
(representing two CA dinucleotides; Fig. 3), suggests that binding activity of Msh2 to 
extrahelical loops longer than 4 nucleotides is not required to maintain microsatellite stability. 
We are currently initiating in vitro repair assays using wild-type and Msh2-I- cell extracts to 
further elucidate this point. 

Mutator phenotype of Ms/z2-deficient cells 

Cell line dMsh2~9 had acquired microsatellite instability (the RER~*" phenotype), indicating 
failure to repair slipped replication intermediates. The frequency of microsatellite instability 
was similar to that of some RER+ tumor cell lines (Parsons et aL, 1993; Bhattacharyya et ai., 
1994). The mutator phenotype of these cells also affects functional genes as evidenced by the 
race of Hprt deficiency in a cLV/5/z2-9 culture which was increased from undetectable (i.e. 
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- lower than 1.6xl0" 7 ) for wild-type cells to 3xl0" 5 . This frequency is in agreement with that 
found in some RER+ tumor cell lines (Bhattacharyya et al., 1994; Shibata et aL 1994; 
Eshleman et al. f 1995), although cell lines that have been selected for MMR deficiency using 
a methylating agent tend to have somewhat lower mutation frequencies (Goldmacher et aL 
1986; Branch et aL, 1993; Kat et al., 1993; Aquilina et aL 1994). Growth rate and plating 
efficiency of A/j/z2-deficient, heterozygous and wild-type cells were indistinguishable, 
suggesting that the mutator phenotype does not interfere with viability of the cells. This is 
remarkable, since both were seriously affected in some cell lines that were selected for MMR 
deficiency using methylating agents (Goldmacher et aL 1986). This suggests that in the latter 
cells, additional events may have taken place that interfere with cell viability. Based on the 
mutator phenotype of dMj/z2-9 cells and on the binding studies described above, we 
conclude that Msh2 is involved in, and essential for, postreplicational MMR. 

Sensitivity to simple methylating agents is mediated by MMR 
Sensitivity of cells to agents that methylate DNA at the O 6 position of G residues has been, 
attributed to futile attempts of MMR to correct the resulting 0 6 -meG.T basepairs (see 
results). This hypothesis is supported by two observations: (i) Prolonged selection of cells 
for tolerance to Methylnitrosourea or MNNG frequently results in loss of MMR (Kat et ai., 

1993) and loss of G.T-binding activity (Branch et al., 1993; Griffin et aL 1994). (ii) Ah 
RER + tumor cell line was found to be tolerant to these agents and sensitivity was restored by 
introduction of the chromosome containing a wild-type allele of the defective mismatch- repair 
gene (Koi et aL, 1994). Our finding that /V/5/z2-deficient cell line cLV/jA2-9 had acquired 
tolerance to MNNG provides direct evidence in support of this hypothesis. This finding may 
have important clinical implications. The triazene class of carcinostatic drugs probably 
derives its cytotoxic effect from methylation at the O 6 position of G residues (Michejda et al., 

1994) . Whereas MMR-deficient tumors are supposedly tolerant to these drugs, they may 
elicit severe cytotoxity in normal tissues of the patient. In addition, the persistence of O^- 
meG.T mispairs in MMR-deficient tumor cells might lead to increased G to A transitions 
(Aquilina et al., 1993), resulting in acceleration of tumor progression. Finally, the finding 
that simple methylating agents select, or induce, MMR deficiency in cells has led to the 
hypothesis that this process is involved in determining the tisssue specificity of tumorigenesis 
in HNPCC patients, due to the presence of such agents in the intestine (Karran and Bignami, 
1994; Varletet al., 1994). 
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jVIMR edits homologous recombination in mammalian cells 
Previously, it has been shown that the presence of less than 1% nucleotide difference can 
strongly suppress homologous recombination in mammalian cells (Waldman and Liskay, 
1988, Te Riele et al., 1992). Here, we have extended these results and we have demonstrated 
that Msh2-I- cells have lost this heterology-dependent suppression of recombination. Based 
on results obtained in £. colt (Worth et al., 1994), this function is probably mediated by the 
Msh2 protein, by recognizing mismatches that occur during heteroduplex formation between 
two not perfectly complementary DNA strands. This may directly destabilize the 
heteroduplex or, alternatively, prevent elongation of strand exchange. Similar results have 
recently been obtained with A/Si/2-deficient yeast strains (Alani et aL, 1994; Selva et al., 
1995), suggesting that this role of MMR is strongly preserved during evolution. In £. coli, 
loss of MMR has been shown to result in a high frequency of inter- and intrachromosomal 
rearrangements due to ectopic recombination between diverged sequences (Rayssiguier et aL, 
1989; Petit et al., 1991). More than the £. coli genome, mammalian genomes contain many 
repeated but diverged sequences. Examples of these are gene families, SINE sequences, 

r 

LINE sequences and pseudogenes (Brosius and Gould, 1992; Nowak, 1994). Ectopic 
recombination between these sequences would lead to genomic rearrangements that may 
either be detrimental for ceil viability due to loss of essential genomic information or, 
alternatively, lead to activation of oncogenes or inactivation of tumor suppressor genes* 
(Bouffler et al, 1993). Homologous recombination between identical DNA stretches should 
be allowed to proceed, however, to allow recombinational repair of double strand breaks 
using the (identical) sister chromatid as sequence donor. For this reason, mechanisms 
counteracting homologous recombination when both recombining stretches contain 
divergencies, while permitting recombination to proceed when both stretches are identical, 
may be of great importance for the mammalian cell to enable repair without risking genome 
rearrangment. Based on the effect of Afj/r2-deficiency on gene targeting, it is likely that 
MMR plays this scrutinizing role. 

Our finding that Msh2 acts not only anti-mutagenic but also anti-recombinogenic, suggests 
that MMR deficiency may contribute to tumorigenesis by elevating the rate of both point 
mutations and chromosomal rearrangements. 

Role of Msh2 deficiency in tumorigenesis 

Ms^-deficiency did not interfere with mouse embryonic development and is compatible with 
adult life. This observation substantiates the recent finding that some HNPCC patients seem 
largely MMR deficient in normal ceils (Parsons et al., 1995). A significant fraction of 
chimeric and pure Msh2-f- mice developed metastasizing lymphomas. It is likely that the 
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•developing immune system in newborn mice, in which a high turnover of maturing T cells 
exists, offers a window where MMR deficiency can strongly accelerate the accumulation of 
transforming events. 

The basal level of mutagenesis in the cell, estimated to be approximately 10" 10 per nucleotide 
per generation, is considered to be insufficient to sustain the multistep clonal outgrowth of 
cells in tumorigenesis (Loeb, 1994). Therefore an increased mutation rate, such as caused by 
MMR deficiency, is likely to be obligatory to accelerate this process (Loeb, 1994). Our 
results suggest that a rapid cell expansion is required to unveil the effect of MMR deficiency 
on tumor development. Corroborating this notion, recent studies of microsatellite instability 
during cancer development show that loss of MMR probably does not occur as the initial 
step, but rather within an early tumor stage such as an adenoma (Young et aU 1993; 
Aaltonen et al M 1994; Chong et al. t 1994; Shibata et al., 1994; Wada et al., 1994), which, in 
HNPCC patients, have been found to evolve rapidly into carcinomas (Lynch et aL, 1993; 
Jass et al., 1994). Moreover, two recent studies support the notion that enhanced 
mutagenesis due to MMR deficiency is responsible for tumor progression: (i): Some RER 4 * 
tumors had acquired frameshift mutations in the [^-Microglobulin gene within a CT repeat, 
indicative of unrepaired replication slippage, and putatively leading to escape from immune 
surveillance (Bicknell et aL, 1994); (ii): in some RER + colon carcinomas from HNPCC 
patients, p53 and APC were found to contain either frameshifts in repeated nucleotides or G- 
to A transitions, indicative of the failure to repair slipped simple sequence repeats and G.T 
mismatches, respectively (Lazar et al., 1994). 

In conclusion, our studies on Msh2 deficiency prove the involvement of Msh2 in MMR and 
show that MMR functions as a mechanism safeguarding the mammalian genome against 
mutation by misincorporation and recombination between diverged sequences. In addition, 
our results lend support to the hypothesis that MMR deficiency can strongly accelerate 
malignant transformation of rapidly expanding cell populations. 
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- Experimental procedures 
Disruption of Msh2 in mouse ES cells 

Genomic Msh2 fragments were obtained by screening a 1290LA-derived genomic DNA 
library with a murine Msh2 cDNA probe (Varlet et aL, 1994). The targeting construct was 
prepared by subcloning a 12.5 kbp BamU I fragment and inserting a hygromycin resistance 
gene (from PGKhyg, te Riele et al. t 1990) into the unique SnaB I site located within an exon 
sequence of Msh2. Cloning procedures were performed according to Sambrook et aL 
(1989). 

The targeting construct was seperated from vector sequences by gel electrophoresis, purified 
by electro elution and introduced into 1290LA-derived ES cell line E14 by electroporauon as 
described (te Riele et aL, 1992). Electroporated cells were seeded onto gelatin-coated 10-cm 
dishes (10 7 cells per plate) and subjected to hygromycin B selection (150 ug per ml) in BRL- 
conditioned medium (Hooper et aL, 1987) the following day. After 10 days, individual 
hygromycin B-resistant colonies were randomly picked and expanded on mouse embryonic 
fibroblasts feeder layers. DNA was extracted from expanded colonies, digested with EcoR I 
and analyzed by Southern hybridization using probes flanking both sides of the targeting 
construct (Fig. 1A). Two cell lines were obtained out of 135 hygromycin B-resistant 
colonies showing bands diagnostic for correct integration of the hyg marker between codoris 
588 and 589 of one copy of the Msh2 gene. These cell lines were designated sMsh2-42 and 
sMsh2-55. One cell line obtained from this experiment, designated wt-2, carrying a randomly 
integrated hyg gene, was used as an Msh2+f+ control. 

To obtain an ES cell line carrying a disruption in both copies of Msh2 9 10 6 cells of cell line 
sMsh2-55 were plated onto 10-cm plates and cultured in BRL-conditioned medium 
containing 1.0 or 1.5 mg per ml of hygromycin B. After 12 days of of culturing in selective 
medium and 7 days in non-selective medium, 24 colonies were obtained which had survived 
1.5 mg per ml of hygromycin B. DNA was extracted from expanded colonies, digested with 
EcoR I and analyzed by Southern hybridization. One cell line, designated dMsh2-9 9 carried 
two disrupted Msh2 alleles, had lost the wild-type copy and contained the normal number of 
chromosomes. ES cell lines which had survived 1.0 mg per ml of hygromycin B still 
contained one disrupted and one wild-type Msh2 copy. One of these Msh2+I~ lines was 
designated $Msh2-2\ and used for several experiments. 

Generation of Msh2 mutant mice 

Chimeric mice were obtained by injecting 10-15 cells of ES cell lines sMsh2-55, sMsh2-42 
and ±Msh2-9 into C57B1/6 blastocysts. Male chimeras obtained with Mshl* 1 ' ES cells were 
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crossed with wild-type 1290LA and FVB mice and found to transmit the mutated Msh2 allele 
through the germ line. Homozygous Msh2 mutant mice were obtained by intercrossing Fl 
heterozygotes. 

Gel shift assay 

Preparation of cell extracts, annealing of oligonucleotides, binding of cell extracts to duplex 
oligonucleotides containing mismatched or extrahelical nucleotides, and nondenaturing poly- 
acrylamide gelelectrophoresis were performed essentially as described (Stephenson and 
Karran, 1989). However* gelelectrophoresis was performed in TAE buffer rather than in 
TBE buffer. To obtain duplex oligonucleotides, the oligonucleotide U: 5'-GGGAAGCT- 
Cj'CCAGGCCCCAGTGTCAGCCTCCTATGCTC-3' (sequences were derived from 
Aquilina et al., 1994) was radiolabelled and annealed with any of the following unlabelled 
oligonucleotides: L-G.T: 5'-GAGCATAGGAGGCTGACATTGGGGCCTGGCAGCTT- 
CCC-3* (resulting in a G.T mismatch); L-G.A: S'-GAGCATAGGAGGCTGACAATGG- 
GGCCTGGCAGCTTCCC-3' (resulting in a G.A mismatch); L-G.G: S'-GAGCATAGG- 
AGGCTGACAGTGGGGCCTGGCAGCTTCCC-3* .(resulting in a G.G mismatch);' 
L-A.C: 5'-GAGCATAGGAGGCTGACACCGGGGCCTGGCAGCTTCCC-3' (resulting 
in an A.C mismatch); L-TG: 5 r -GAGCATAGGAGGCTGACACTGTGGGGCCTGGCA- 
GCTTCCC-3' (resulting in an extrahelical TG dinucleotide); L-HOM: 5'-G AGC ATAGG A-. . 
GGCTGACACTGGGGCCTGGCAGCTTCCC-3' (resulting in a homoduplex); 
L-LOOP14: S'-GAGCATAGGAGGCTGACACATACGTGAGTACTCTGGGGCCTGG- 
CAGCTTCCC-3 , (resulting in an IDL loop of 14 extrahelical nucleotides). In all assays, a 
twofold excess of unlabelled homoduplex competitor oligonucleotide was included. As a 
positive control, a duplex oligonucleotide containing the binding site for the E2F family of 
transcription factors was used (Beijersbergen et aL, 1995). 

PCR amplification of microsatellites 

Subclones of ES ceil lines &Msh2-9 and wt-2 were generated by seeding cells onto mouse 
embryonic fibroblasts feeder layers at a density of 10 3 ceils per 10 cm 2 . At that time, the ES 
cell lines were in culture for approximately 20 divisions since their generation. Twenty-four 
colonies from each cell line were expanded. Chromosomal DNA was isolated and subjected 
to the polymerase chain reaction using two end-labelled primer pairs {D14MU15 and 
D7Mit!7, Dietrich et aL, 1994). Amplified products were electrophoresed on a denaturing 
polyacrylamide gel. 
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.Mutation frequency 

6xl0 6 cells of ES cell lines dmshl-9 and wt-2 were plated onto 486 cm 2 gelatin-coated 
tissue culture surface in BRL-conditioned medium. After two days, 6-Thioguanine was 
added at a concentration of 10 jig per ml. After two weeks the number of resistant colonies 
was counted. 

Sensitivity to MNNG 

ES cell lines &NIsh2-9, $Msh2-2l and wt-2 were seeded onto MEF feeder layers at a density 
of 10 3 cells per 4 cm 2 - The following day, cells were exposed for one hour to MNNG 
ranging in concentration from zero to 36.45 (iM in serum-free medium. After 4 days of 
incubation, cells were trypsinized, stained with Trypan blue and counted. During the whole 
procedure from one hour before exposure to MNNG, 0 6 -benzylguanine (20 jiM, kindly 
provided by A. Pegg) was included in the medium in order to competiveiy inhibit 
endogenous methyltransferase activity that might otherwise remove the methyl groups added 
by MNNG (Dolan et al, 1990). 

Rb targeting with isogenic and non-lsogenic DNA 

Targeting and subsequent analysis of the Rb locus in a BALB/c-derived ES cell line (kindly 
provided by S. Rastan) and 129/OLA-derived ES cell lines dMsh2-9 were performed 
essentially as described (te Riele et al, 1992). The targeting constructs \29Rb-neo (derived 
from the 1290LA genome) and BfcRb-neo (derived from the BALB/c genome) carry an 
insertion of the pMClneo marker in exon 19 of the Rb gene and differ approximately 0,6% at 
the nucleotide level (te Riele et al, 1992). 
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-Legends 

Fig. 1. Targeted disruption of the mouse Msh2 gene. 

A: Top: schematic diagram of the mouse Msh2 protein. ATP binding and HTH: putative 
ATPase and helix turn helix domains, respectively (Varlet et al t 1994). The position at which 
the protein is disrupted by insertion of the Hyg marker is indicated. Bottom: the genomic 
Msh2 locus; indicated below is the targeting construct carrying the Hyg marker inserted into a 
unique SnaB I site within an exon sequence of Msh2. The positions of the external probes 
used for detection of targedng events are indicated. 

B: The Msh2 locus in A/sA2+/+ cell line wt-2, Msh2+A cell line sAfa/z2-55 and Msh2-A cell 
line dMy/i2-9. Genomic DNA of these cell lines was digested with EcoRl and analyzed by 
Southern hybridization using probe 1 (Fig. 1A). Arrowheads indicate the positions of the 
wild type (wt, 20 kbp), and disrupted (ko, 7.5 kbp) alleles. 

Fig. 2. Mismatch-binding activiry in wild-type and Afc/i2-deficient cells. 
Binding activity in extracts of Msh2+/+ cell line wt-2 and Msh2-/- cell line dmsh2-9 to 
double-stranded oligonucleotides containing a mismatch (G*T, G.A, G.G, orA.C), an 
extrahelical dinucleotide (Extrahelical TG), or an extrahelical 14-nucleotide loop (LOOP 
14) and a Homoduplex was assessed using a gel retardation assay. Sequences of the- 
oligonucleotides are given under Experimental Procedures. Arrowheads indicate the positions 
of mismatch-specific complexes. E2F: Binding to an oligonucleotide carrying an E2F site 
served as a positive control (Beijersbergen et a], 1995). 

Fig. 3. Microsatellite stability in wild-type and Msh2 -deficient cells. 

Genomic DNA of 24 subclones derived from Msh2+/+ cell line wt-2 and Mshl-I- cell line 
dmsh2~9 was amplified by PCR using primer sets D7Mitl7 and D14Mitl5. Arrowheads 
indicate microsatellite alleles with a detectable length alteration. 

Fig. 4: Tolerance of A/s/:2-deficient cells to the simple methylating agent MNNG. 
ES cell lines wt-2 (M*A2+/+), sMsh2-55 (Msh2+f-) and dMsh2~9 {Msh2-I-) were exposed to 
increasing amounts of MNNG for one hour in the presence of 0 6 -benzylguanine. After four 
days of incubation, cells were trypsinized and counted. 
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Table 1. Homologous recombination has lost dependence on sequence identity in Msh2 
deficient ES cells. 



ES cells Homologous recombination Isogenic vs 

vs total no. G418 R col. with non-isogenic 



129ffib-neo B/cRb-neo 

BALB/c (Ms/?2+/+) 1/68 (1 .5%) 1 6/72 (22%) 1 5 x 

1290U\ {Msh2+I+) 33/94 (35%) 1/144 (0.7%) 50 x 

1290LA (Ms/72-/-) 42/185 (23%) 47/184 (26%) 0.9 X 

129 Rb-neo and BfcRb-neo are Rb targeting constructs, prepared from the 129 and BALB/c 
strains of mice, respectively, with 0.6% sequence divergency. Targeting frequencies in 
1290LA (Msh2+f+) ES cells are derived from te Riele et ai., 1992. 
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Genotype 



Age 

(months) 



Condition 
at birth 



Tumor incidence 



type 



frequency" age 

(months) 



Msh2+/- 



Msh2+/+::Msh2-/- 



Msh2-/- 



4-5 



healthy 0/110 

healthy Lymphoma 1/24 

healthy Lymphoma 6/19 

Histiocytic 1/19 
sarcoma 



2-4 
3.5 



*Chimeras consisting of wild-type and Msh2-I- cells 

**Number of mice carrying a tumor per total number in the experiment 
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